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1 ttccggtttt tctcagggga cgttgaaatt atttttgtaa cgggagtcgg gagaggacgg 
61 ggcgtgcccc gcgtgcgcgc gcgtcgtcct ccccggcgct cctccacagc tcgctggctc 
121 ccgccgcgga aaggcgtcat gccgcccaaa accccccgaa aaacggccgc caccgccgcc 
181 gctgccgccg cggaaccccc ggcaccgccg ccgccgcccc ctcctgagga ggacccagag 
241 caggacagcg gcccggagga cctgcctctc gtcaggcttg agtttgaaga aacagaagaa 
301 cctgatttta ctgcattatg tcagaaatta aagataccag atcatgtcag agagagagct 
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541 ctaaaagaaa ttgataccag taccaaagtt. gataatgcta tgtcaagact gttgaagaag 
601 tatgatgtat tgtttgcact cttcagcaaa ttggaaagga catgtgaact tatatatttg 
661 acacaaccca gcagttcgat atctactgaa ataaattctg cattggtgct aaaagtttct 
721 tggatcacat ttttattagc taaaggggaa gtattacaaa tggaagatga tctggtgatt 
781 tcatttcagt taatgctatg tgtccttgac tattttatta aactctcacc tcccatgttg 
841 ctcaaagaac catataaaac agctgttata cccattaatg gttcacctcg aacacccagg 
901 cgaggtcaga acaggagtgc acggatagca aaacaactag aaaatgatac aagaattatt 
961 gaagttctct gtaaagaaca tgaatgtaat atagatgagg tgaaaaatgt ttatttcaaa 
1021 aattttatac cttttatgaa ttctcttgga cttgtaacat ctaatggact tccagaggtt 
1081 gaaaatcttt ctaaacgata cgaagaaatt tatcttaaaa ataaagatct agatgcaaga 
1141 ttatttttgg atcatgataa aactcttcag actgattcta tagacagttt tgaaacacag 
1201 agaacaccac gaaaaagtaa ccttgatgaa gaggtgaatg taattcctcc acacactcca 
1261 gttaggactg ttatgaacac tatccaacaa ttaatgatga ttttaaattc agcaagtgat 
1321 caaccttcag aaaatctgat ttcctatttt aacaactgca cagtgaatcc aaaagaaagt 
1381 atactgaaaa gagtgaagga tataggatac atctttaaag agaaatttgc taaagctgtg 
1441 ggacagggtt gtgtcgaaat tggatcacag cgatacaaac ttggagttcg cttgtattac 
1501 cgagtaatgg aatccatgct taaatcagaa gaagaacgat tatccattca aaattttagc 
1561 aaacttctga atgacaacat ttttcatatg tctttattgg cgtgcgctct tgaggttgta 
1621 atggccacat atagcagaag tacatctcag aatcttgatt ctggaacaga tttgtctttc 
1681 ccatggattc tgaatgtgct taatttaaaa gcctttgatt tttacaaagt gatcgaaagt 
1741 tttatcaaag cagaaggcaa cttgacaaga gaaatgataa aacatttaga acgatgtgaa 
1801 catcgaatca tggaatccct tgcatggctc tcagattcac ccttatttga tcttattaaa 
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2101 tatcggctag cctatctccg gctaaataca ctttgtgaac gccttctgtc tgagcaccca 
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Leu *** Cys His Leu> 
SP6 PROMOTER > 



>BanII 
I 

> Bs iHKAI 
I 

>SacI 
I 

>Eg1136II| >BclI 
I I 

. >BGH_POLY_A [ 

1150 | 11160 i 1170 1180 * 1190 1200 

* I * I I * 

AAT G CTAGAGCTCG CTGATCAGCC TCGACTGTGC CTTCTAGTTG CCAGCCATCT 
Asn> 
> 

>BanI 

I 

1210 1220 1230 1240 1250 1260 

* * * * * ★ * * * * * * 

GTTGTTTGCC CCTCCCCCGT GCCTTCCTTG ACCCTGGAAG GTGCCACTCC CACTGTCCTT 

1270 1280 1290 1300 1310 1320 

* * * + * * * * * ★ *■ * 

TCCTAATAAA ATGAGGAAAT TGCATCGCAT TGTCTGAGTA GGTGTCATTC TATTCTGGGG 

>BbsI 
I 

1330 1340 1350 1360 | 1370 1380 

* * ** * * * *• | * * * * 

GGTGGGGTGG GGCAGGACAG CAAGGGGGAG GATTGGGAAG ACAATAGCCG AAATGACCGA 

>BssSI 



1390 



1430 



1440 



>BspMI 
I 

1400 I 1410 1420 

** * * j* * * * ** * ■ * 

CCAAGCGACG CCCAACCTGC CATCACGAGA TTTCGATTCC ACCGCCGCCT TCTATGAAAG 

>NaeI 
I 

>BsrFI 

I I 
>BpmI | 



>NgoMI 
II' 

1450 1460 1470 I 1480 1490 1500 

+ * * * * |* | * * * * + * 

GTTGGGCTTC GGAATCGTTT TCCGGGACGC CGGCTGGATG ATCCTCCAGC GCGGGGATCT 
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>BomI 
1 

>SV40_early_poly__A 

1510 1520 11530 1540 1550 1560 

* * * * *j* * * * * + * ■ 

CATGCTGGAG TTCTTCGCCC ACCCCAACTT GTTTATTGCA GCTTATAATG GTTACAAATA 

>ApoI >BsmI 
I I 
1570 |1580 1590 1600 11610 1620 

* * *l + * * * * *!*.-■* * 
AAGCAATAGC ATCACAAATT TCACAAATAA AGCATTTTTT TCACTGCATT CTAGTTGTGG 

>HincII 
1 

>Bstll07I >AccI 
t I I 

>AccI >SalI 
II II! 

1630 1640 1650 1660 fll 1670 1680 

* * + * * ■ * * I I * I I* * * * 
TTTGTCCAAA CTCATCAATG TATCTTATCA TGTCTGTATA CCGTCGACCT CTAGCTAGAG^ 

> >BsrBI ■ 

I 

1690 1700 1710 1720 1730 1740 
+ * * * * * * * |* * * 
CTTGGCGTAA TCATGGTCAT AGCTGTTTCC TGTGTGAAAT TGTTATCCGC TCACAATTCC 
c PUC19 BACKBONE H3 TO AATII c > 

>BanI 
I 

1750 1760 1770 1780 1 1790 1800 

* * * * * * * * I * * * * 

ACACAACATA CGAGCCGGAA GCATAAAGTG TAAAGCCTGG GGTGCCTAAT GAGTGAGCTA 
■ c PUC19 BACKBONE H3 TO AATII c > 

> Asel 

1810 1820 1830 . 1840 1850 1860 

*l* * * ** ** ** 

ACTCACATTA ATTGCGTTGC GCTCACTGCC CGCTTTCCAG TCGGGAAACC TGTCGTGCCA 
c PUC19 BACKBONE H3 TO AATII c > 

> PvuII 

>MspAlI >AseI >EaeI >HaeII 
III ■ . t 
| 1870 11880 1890 1900 1910 I 1920 
l+l* * | * ** ** ** ** 
GCTGCATTAA TGAATCGGCC AACGCGCGGG GAGAGGCGGT TTGCGTATTG GGCGCTCTTC 
c PUC19 BACKBONE H3 TO AATII c > 

>EarI 

>SapI >BsiEI >BsrBI 

! 1930 1940 1950 I 1960 1970 1980 

* + ** ** ** 
CGCTTCCTCG CTCACTGACT CGCTGCGCTC GGTCGTTCGG CTGCGGCGAG CGGTATCAGC 

c PUC19 BACKBONE H3 TO AATII c > 
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>Af 

I 

1990 2000 2010 2020 2030 2040 

** * * + + * * * ★ *l* 

TCACTCAAAG GCGGTAATAC GGTTATCCAC AGAATCAGGG GATAACGCAG GAAAGAACAT 
c PUC19 BACKBONE H3 TO AATII c > 

2050 2060 2070 2080 2090 2100 

* * * * * * * * * * * * 

GTGAGCAAAA GGCCAGCAAA AGGCCAGGAA CCGTAAAAAG GCCGCGTTGC TGGCGTTTTT 
c PUC19 BACKBONE H3 TO AATII c > 

>DrdI 
I 

2110 2120 2130 2140 I 2150 2160 

* * + ★ + * * * **■ + *- 

CCATAGGCTC CGCCCCCCTG ACGAGCATCA CAAAAATCGA CGCTCAAGTC AGAGGTGGCG 
c PUC19 BACKBONE H3 TO AATII c > 

>BssSI 
I 

2170 2180 2190 2200 2210 2220 

* * + * * * * * * * **■ 

AAACCCGACA GG ACT AT AAA GATACCAGGC GTTTCCCCCT GGAAGCTCCC TCGTGCGCTC 
c PUC19 BACKBONE H3 TO AATII c > 

>BsaWl 
I 

2230 2240 I 2250 2260 2270 2280 

ir * * * | * * * * * * * ■ * 

TCCTGTTCCG ACCCTGCCGC TTACCGGATA CCTGTCCGCC TTTCTCCCTT CGGGAAGCGT 
c PUC19 BACKBONE H3 TO AATII c > 

>HaeII >SfcI 

| I 

1 2290 2300 | 2310 2320 2330 2340 

+ + * * | * + ** * + + * 

GGCGCTTTCT CAATGCTCAC GCTGTAGGTA TCTCAGTTCG GTGTAGGTCG TTCGCTCCAA 
c PUC19 BACKBONE H3 TO AATII C > 

> Bs iHKAI >MspAlI 

I ' I 

>ApaLI I >BsiEI >BsaWI 
*-|| t I I 

2350 I I 2360 2370 2380 2390 2400 

** * | |* *■ * ** 

GCTGGGCTGT GTGCACGAAC CCCCCGTTCA GCCCGACCGC TGCGCCTTAT CCGGTAACTA 
c PUC19 BACKBONE H3 TO AATII c > 

>AlwNI 
I 

2410 2420 2430 2440 2450 I 2460 
+ * ** * + 
TCGTCTTGAG TCCAACCCGG TAAGACACGA CTTATCGCCA CTGGCAGCAG CCACTGGTAA 
c PUC19 BACKBONE H3 TO AATII c > 

>SfcI 
I 

2470 2480 2490 I 2500 2510 2520 

** * * **|** ** ** 

CAGGATTAGC AGAGCGAGGT ATGTAGGCGG TGCTACAGAG TTCTTGAAGT GGTGGCCTAA 
c PUC19 BACKBONE H3 TO AATII c 
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2530 2540 2550 2560 2570 2580 

+ * * * * * * * + * * * 

CTACGGCTAC ACTAGAAGGA CAGTATTTGG TATCTGCGCT CTGCTGAAGC CAGTTACCTT 
c PUC19 BACKBONE H3 TO AATII c > 

>Eco57I >MspAlI 

i I 

| 2590 2600 2610 2620 I 2630 2640 

* * + * * * * + | * - + * * 

CGGAAAAAGA GTTGGTAGCT CTTGATCCGG CAAACAAACC ACCGCTGGTA GCGGTGGTTT 
c PUC19 BACKBONE H3 TO AATII c_ > 

2650 2660 2670 2680 2690 2700 

** * * * * * * * * * * 

TTTTGTTTGC AAGCAGCAGA TTACGCGCAG AAAAAAAGGA TCTCAAGAAG ATCCTTTGAT 
c PUC19 BACKBONE H3 TO AATII c > 

>BspHI 
I 

2710 2720 2730 2740 2750 2760 

* * • * * * * * * . * * *|* 

CTTTTCTACG GGGTCTGACG CTCAGTGGAA CGAAAACTCA CGTTAAGGGA TTTTGGTCAT 
c PUC19 BACKBONE H3 TO .AATII c > 

>DraI >DraI 

I I 

2770 2780 2790 |2800 2810. I 2820 

+ * * * * * +|* * * * * 

GAGATTATCA AAAAGGATCT TCACCTAGAT CCTTTTAAAT TAAAAATGAA GTTTTAAATC 
c PUC19 BACKBONE H3 TO AATII c > 

>BanI 
I 

2830 2840 2850 2860 2870 2880 

* * * * * * * * * * * | * 

AATCTAAAGT ATATATGAGT AAACTTGGTC TGACAGTTAC CAATGCTTAA TCAGTGAGGC 

a AMP-ORE" > 

c PUC19 BACKBONE H3 TO AATII c 5 

>Ahdl 
I 

2890 2900 2910 2920 2930 2940- 

* * + * + + * ** 

ACCTATCTCA GCGATCTGTC TATTTCGTTC ATCCATAGTT GCCTGACTCC CCGTCGTGTA 

a a AMP-ORF a a > 

c PUC19 BACKBONE H3 TO AATII c > 



>BsaI 
I 

>BsrDI >Bpm 
i I 

2950 2960 2970 2980 2990 I 3000 

+ + ** ** ** **|** 

GATAACTACG ATACGGGAGG GCTTACCATC TGGCCCCAGT GCTGCAATGA TACCGCGAGA , 
a a AMP-ORF a a > 



"c PUC19 BACKBONE H3 TO AATII_ 



>BsrFl >BG1I 

I 1 - 

3010 3020 3030 3040 3050 3060 

** # * + * ** + * + + 

CCCACGCTCA CCGGCTCCAG ATTTATCAGC AATAAACCAG CCAGCCGGAA GGGCCGAGCG 

a a AMP-ORF a a 

c PUC19 BACKBONE H3 To AATII c 
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>AseI 
I 

3070 3080 3090 3100 | 3110 3120 

* * * * * * * * j * * * * 

CAGAAGTGGT CCTGCAACTT TATCCGCCTC CATCCAGTCT ATTAATTGTT GCCGGGAAGC. 

a a AMP-ORF a a > 

C PUC19 BACKBONE H3 TO AATII c > 

>Pspl406l' 



>BsrDl >Sfcl 
I I 
3160 I 3170 | 3.180 



* 



>FspI 
I 

3130 3140 3150 I 

****** i , 

TAGAGTAAGT AGTTCGCCAG TTAATAGTTT GCGCAACGTT GTTGCCATTG CTACAGGCAT 

a a AMP-ORF • a a_ > 

c PUC19 BACKBONE H3 TO AATII c__ > 

>MslI >BsaWI 

I 3190 3200 3210 3220 | 3230 3240 

* * * * * * * - * | * * * * 

CGTGGTGTCA CGCTCGTCGT TTGGTATGGC TTCATTCAGC TCCGGTTCCC AACGATCAAG 

a a AMP-ORF _a a > 

c PUC19 BACKBONE H3 TO AATII c > 



>PvuI 
I 

>BsiEI 
I 

3250 3260 3270 3280 3290 3300 

*★ ** + * ** ** ** 

GCGAGTTACA TGATCCCCCA TGTTGTGCAA AAAAGCGGTT AGCTCCTTCG GTCCTCCGAT 

a a AMP-ORF a a > 

c PUC19 BACKBONE H3 TO AATII c > 



>EaeI >MslI 

i I 
3310 3320 3330 3340 I 3350 3360 

* * * t * ** **|** ** 

CGTTGTCAGA AGTAAGTTGG CCGCAGTGTT ATCACTCATG GTTATGGCAG CACTGCATAA 

a a AMP-ORF a a ; > 

c PUC19 BACKBONE H3 TO AATII c > 



>ScaI 
I 

3370 3380 3390 3400 3410 3420 

* * ** ** ** ** ** 

TTCTCTTACT GTCATGCCAT CCGTAAGATG CTTTTCTGTG ACTIGGTGAGT ACTCAACCAA 

a a AMP-ORF a a_ > 

C PUC19 BACKBONE H3 TO AATII c > 

>BsiEI 

3430 3440 3450 3460 3470 3480 

*★■** *j* ** ** ** 

GTCATTCTGA GAATAGTGTA TGCGGCGACC GAGTTGCTCT TGCCCGGCGT CAATACGGGA 

a a AMP-ORF a a > 

~ c PUC19 BACKBONE H3 TO AATII c > 
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>XmnI 
I 

>DraI >BsiKKAI >Pspl406I 
I I I 

3490 3500 3510 13520 3530 3540 

* * * * *|* *j* *j* * * 

TAATACCGCG CCACATAGCA GAACTTTAAA AGTGCTCATC ATTGGAAAAC GTTCTTCGGG 

a a AMP-ORF_ a a > 

c PUC19 BACKBONE H3 TO AATII c > 



>Eco571 



>ApaLI 
I 

>MspAlI >BssSI 
I II 
3550 3560 I 3570 3580 3590 | 3600 

* * * * * * **|*|* 

' GCGAAAACTC TCAAGGATCT TACCGCTGTT GAGATCCAGT TCGATGTAAC CCACTCGTGC 

a a AM P -O R F a a > 

~~ c PUC19 BACKBONE H3 TO AATII c > 

>BsiHKAI 

[ 3610 3620 3630 3640 3650 3660 

I** * * * * * + + * * * 

ACCCAACTGA TCTTCAGCAT CTTTTACTTT CACCAGCGTT TCTGGGTGAG CAAAAAC AGG 

a a AMP-ORF a a > 

c PUC19 BACKBONE H3 To AATII c > 

>MslI 
I 

3670 3680 3690 3700- 3710 3720 

** * * * + * * * * * * 

AAGGCAAAAT GCCGCAAAAA AGGGAATAAG GGCGACACGG AAATGTTGAA TACTCATACT 

a __a AMP-ORF a a > 

c PUC19 BACKBONE H3 TO AATII c > 

>EarI >SspI >BspHI >BsrBI 

II II 
| 3730 I 3740 3750 3760' i 3770 1 3780 

* * | # * * * ★ ★ * * | * * 

CTTCCTTTTT CAATATTATT GAAGCATTTA TCAGGGTTAT TGTCTCATGA GCGGATACAT 
■ c PUC19 BACKBONE H3 TO AATII C > 

3790 3800 3810 3820 3830 3840 

** + + + * ** * * 

ATTTGAATGT ATTTAGAAAA ATAAACAAAT AGGGGTTCCG CGCACATTTC CCCGAAAAGT 
c PUC19 BACKBONE H3 TO AATII c > 



>HincII 

i 

>AccI 
I I 

>AatII 
I I 

>SalI 
I I I 

3850 I I I 
* III 

GCCACCTGAC GTC 
c > 
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>HincII 
I 

>AccI 
I I 

>BglII >SalI 

I III 

10 t 20 30 I I I 40 50 60 

. * * j * * * * I I I * * * * * * 

GACGGATCGG GAGATCTCCC GATCCCCTAT GGTCGACTCT CAGTACAATC TGCTCTGATG 

>AlwNI 

i 

70 80 90 100 110 120 

* * * * * * * * * * * * 

CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT GGAGGTCGCT GAGTAGTGCG 

>ApoI >MfeI 

I I 

| 130 140 150 . 160 | 170 180 

* j * * * * * * * | * + . * ★ 

q CGAGCAAAAT TTAAGCTACA ACAAGGCAAG GCTTGACCGA CAATTGCATG AAGAATCTGC 

^0 >HincII 



M - >AflIII 

>NruI >MluI 

U 190 200 I 210 220 1230 

rn * * * * * | * * * * | * 

y TTAGGGTTAG GCGTTTTGCG CTGCTTCG CGA TGT ACG GGC CAG ATA TAC GCG TTG 
l_ Arg Cys Thr Gly Gin lie Tyr Ala Leu> 
Q . e e CMV PROMOTER e e > 

>SpeI > Asel 

"■n I I 

^ 240 250 I 260 270 280 

\Q * *|* *|** * * * 

h.Q ACA TTG ATT ATT GAC TAG TTA TTA ATA GTA ATC AAT TAC GGG GTC ATT 
Thr Leu lie lie Asp *** Leu Leu lie Val lie Asn Tyr Gly Val Ile> 
e e e e e e J3MV PROMOTER e e e e e e > 

^ 290 300 310 320 330 

* * * ******* 

AGT TCA TAG CCC ATA TAT GGA GTT CCG CGT TAC ATA ACT TAC GGT AAA 
Ser Ser *** Pro lie Tyr Gly Val Pro Arg Tyr He Thr Tyr Gly Lys> 
e e e e e e_CMV PROMOTER e e e e e e > 

>BglI >AatII 

I I 

340 350 360 370 I 

★ ** **** + * 

TGG CCC GCC TGG CTG ACC GCC CAA CGA CCC CCG CCC ATT GAC GTC AAT 
Trp Pro Ala Trp Leu Thr Ala Gin Arg Pro Pro Pro He Asp Val Asn> 
e e e e e e_CMV PROMOTER e e e e e e > 

380 390 400 410 420 

* ** * ****** 

AAT GAC GTA TGT TCC CAT AGT AAC GCC AAT AGG GAC TTT CCA TTG ACG 
Asn Asp Val Cys Ser His Ser Asn Ala Asn Arg Asp Phe Pro Leu Thr> 
e e e e e e CMV PROMOTER e e e e e e > 
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>AatII >BglI 
I I 
430' 440 450 460 I 470 

I * * * * * *■ + + * ★ 

TCA ATG GGT GGA CTA TTT ACG GTA AAC TGC CCA CTT GGC AGT ACA TCA 

Ser Met Gly Gly Leu Phe Thr Val Asn Cys Pro Leu Gly Ser Thr Ser> 

e e e e e e_CMV PROMOTER e e e e e e > 

>Ndel >AatII 

! " 
480 i 490 500 510 I 520 

* I* * * * * * | * ■ * 

AGT GTA TCA TAT GCC AAG TAC GCC CCC TAT TGA CGT CAA TGA CGG TAA 

Ser Val Ser Tyr Ala Lys Tyr Ala Pro Tyr *** Arg Gin *** Arg ***> 

e e e CMV PROMOTER e e e e e e_ > 



e e e 



>BglI 

530 ! 540 550 560 570 

* *l* * * *** * * * 

ATG GCC CGC CTG GCA TTA TGC CCA GTA CAT GAC CTT ATG GGA CTT TCC 
Met Ala Arg Leu Ala Leu Cys Pro Val His Asp Leu Met Gly Leu Ser> 
e e e e e e_CMV PROMOTER e e e e e e > 

>BsaAI >NcoI 

I ! 

>SnaBI >StyI >MslI 

! II 
580 590 600 610 | 
*** ****** 
TAC TTG GCA GTA CAT CTA CGT ATT AGT 'CAT CGC TAT TAC CAT GGT GAT 
Tyr Leu Ala Val His Leu Arg lie Ser His Arg Tyr Tyr His Gly Asp> 
e e e CMV PROMOTER e e • e e e e_ > 



e e e 



620 630 640 650 660 

+ * * * * + * 

GCG GTT TTG GCA GTA CAT CAA TGG GCG TGG ATA GCG GTT TGA CTC ACG 
Ala Val Leu Ala Val His Gin Trp Ala Trp He Ala Val *** Leu Thr> 
e e e e e e_CMV PROMOTER e e e e e e > 

>AatII >BanI 
I ! 
670 680 690 I 700 710 I 
* + ■ * * * * I * * * * 
GGG ATT TCC AAG TCT CCA CCC CAT TGA CGT CAA TGG GAG TTT GTT TTG 
Gly He Ser Lys Ser Pro Pro His *** Arg Gin Trp Glu Phe Val Leu> 
e e e e e e CMV PROMOTER e e e e e e > 



720 730 740 750 760 

* * * * * * * * . * 

GCA CCA AAA TCA ACG GGA CTT TCC AAA ATG TCG TAA CAA CTC CGC CCC 
Ala Pro Lys Ser Thr Gly Leu Ser Lys Met Ser *** Gin Leu Arg Pro> 
e e e CMV PROMOTER e e e e e e_. > 



e e e 



★ 



770 780 790 800 810 

* * * * * * * * 



ATT GAC GCA AAT GGG CGG TAG GCG TGT ACG GTG GGA GGT CTA TAT AAG 
He Asp Ala Asn Gly Arg *** Ala Cys Thr Val Gly Gly Leu Tyr Lys> 
e e e e e e CMV PROMOTER e e e_ e e e > 
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>SacI 



Banll 
I 

> Bs iHKAI 
I 

>Ecll36II 
I I 
I 820 
* I I * * 

CAG AGC TCT CTG GCT AAC TAG AGA ACC CAC TGC TTA CTG GCT TAT CGA 
Gin Ser Ser Leu Ala Asn *** Arg Thr His CysLeu Leu Ala Tyr Arg> 
e e e e e e CMV PROMOTER e e e e e e > 



830 
+ ■ 



840 



850 



>KpnI 



>AseI 
I 

>T7 PROMOTER 



>BsaI 
I 

>Sfcl 



860 



870 



I 



I* 



880 



>HindIII 



890 
I 



>Acc65I 
I 

>BanI 



900 



910 



AAT T AATACGA CTCACTATAG GGAGACCCAA GCTTCGCGCG GGTACCACTC 
Asn Xxx> 



e_> 

>EarI ' 
I 

[ 920 



>Pf 1MI 



930 



>PvuII 
I 

940 



>BanII 
I 

950 



960 



970 



TCTTCCGCAT CGCTGTCTGC GAGGGCCAGC TGTTGGGCTC GCGGTTGAGG ACAAACTCTT 
f f TRIPARTITE LEADER f f > 



>EarI 



>ScaI 
I 



980 



I 990 



1000 



1010 



1020 



1030 



CGCGGTCTTT CCAGTACTCT TGGATCGGAA ACCCGTCGGC CTCCGAACGG TACTCCGCCA 



>EcoO109I 



f TRIPARTITE LEADER f_ 



I 



>BsiEI 
I 



>PpuMI 
t 

I 1040 



>BsoBI 



>AvaI 
I 

>XhoI 
I 

>PaeR7l 



>BsaWI 

I I 
1060 

*l* + * * M* * * " * * * * 

CCGAGGGACC TGAGCGAGTC CGCATCGACC GGATCGGAAA ACCTCTCGAG GAACTGAAAA 



1050 



1070 



I 

I 1080 



1090 



TRIPARTITE LEADER 
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>HincII >Eco0109I >BsaWI 

I ! f 

>HpaI >PpuMI >BamHI 

.1 111 
1100 | 1110 1120 1130 1140 I 1150 
* *l* * * * + * *j* |* I* 
ACCAGAAAGT TAACTGGTAA GTTTAGTCTT TTTGTCTTTT TATTTCAGGT CCCGGATCCG 
b HYBRID SV4 0 LATE INTRON b > 

>BseRI ' >StuI 

I I 

1160 11170 1180 1190 1200 | 1210 

**.*!* * * + + * + * * 

GTGGTGGTGC AAATCAAAGA ACTGCTCCTC AGTGGATGTT GCCTTTACTT CTAGGCCTGT 
b HYBRID SV4 0 LATE INTRON b > 

>BsiEI 
I 

>EagI| 
I I 

>EaeI| >XbaI 

I I I 
>SacII >PstI 

II II 
>NotI I >SfcI I I 

II I II 

1220 1230 1240 1250 I 1260 I 1 1270 
* * * * * * * * I * I * I * 
ACGGAAGTGT TACTTCTGCT CTAAAAGCTG CGGAATTGTA CCCGCGGCCG CTGCAGTCTA 
HYBRID SV4 0 LATE INTRON b > 

>ApaI 
I 

>BspDI >EcoO109l 
I I I 

>ApoI >EcoRV| >Bspl20I 

I t I 111 , 

>EcoRI >BsiWI >ClaI |>BanII >SfcI >MslI 
I I I I I I I 1 > 

| 1280 1 1290 I I 1300. I 1310 1320 

I* * , * | * * | I* * * * | * 

GACGAATTCG CGTACGATAT CGATGGGCCC TATT CTA TAG TGT CAC CTA AAT 

■ Leu *+* C y S H i 5 l gu Asn> 

c_SP6 PROMOTER c > 

>SacI 
I 

>BanII 
I 

>BsiHKAI 
I 

>Ecll36II I >BclI 
I I I 
> BGH_POLY_A | I 

I 1330 | 1 1340 1350 1360 1370 -1380 

1+1*1** * * * * + * * * 

GCTAGAGC TCGCTGATCA GCCTCGACTG TGCCTTCTAG TTGCCAGCCA TCTGTTGTTT 

>BanI 

1390 1400 1410 I 1420 1430 1440 

* * ** ± * | * * * * * + 

GCCCCTCCCC CGTGCCTTCC TTGACCCTGG AAGGTGCCAC TCCCACTGTC CTTTCCTAAT 
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in 



1450 1460 1470 1480 1490 1500 

* * + * * * * ★ * * * * 

AAAATGAGGA AATTGCATCG CATTGTCTGA GTAGGTGTCA TTCTATTCTG GGGGGTGGGG 

>BbsI 
I 

1510 1520 1530 1540 1550 1560 

* * * * ** *l* * * * + 

TGGGGCAGGA CAGCAAGGGG GAGGATTGGG AAGACAATAG CCGAAATGAC CGACCAAGCG 

>BspMI 
I 

>BssSI 
I 



1600 
* 



1610 



1620 



1570 1580 1590 

* * * j ★ * * 

ACGCCCAACC TGCCATCACG AGATTTCGAT TCCACCGCCG CCTTCTATGA AAGGTTGGGC 

>NaeI 
I 

>NgoMI 

! 1 
>BpmI 

I t 
>BsrFI 
I I 

1630 1640 | I 1650 1660 1670 1680 

* * * * I I * * * * * * * * 
TTCGGAATCG TTTTCCGGGA CGCCGGCTGG ATGATCCTCC AGCGCCGGGA TCTCATGCTG 

>BpmI 
I 

> S V 4 0 __e a r 1 y _j?o 1 y _A 
I 

1690 1700 1710 1720 1730 1740 

* * *■*■ * * * * 

GAGTTCTTCG CCCACCCCAA CTTGTTTATT GCAGCTTATA ATGGTTACAA ATAAAGCAAT 

>ApoI >BsmI 

I I 

1750 1760 1770 1780 1790 1800 

*l* * + * * * ■ | * * * * * 

AG CATC AC AA ATTTCACAAA TAAAGCATTT TTTTCACTGC ATTCTAGTTG TGGTTTGTCC 

>HincII 
I 

>Bstll07I >AccI 



1810 



1820 



>AccI 
i 

1830 



>SalI 
I E I 

I 1840 

* ! i i * 



1850 



1860 
* 



AAACTCATCA ATGTATCTTA TCATGTCTGT ATACCGTCGA CCTCTAGCTA GAGCTTGGCG 



>BsrBI 



1870 



1880 



1890 



1900 



1910 



1920 



TAATCATGGT CATAGCTGTT TCCTGTGTGA AATTGTTATC CGCTCACAAT TCCACACAAC 
d d PUC19 BACKBONE d d > 
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>BanI 
I 

1930 1940 1950 I 1960 1970 1980 

* * * * * * |* * * + * * 

ATACGAGCCG GAAGCATAAA GTGTAAAGCC TGGGGTGCCT AATGAGTGAG CTAACTCACA 
d d PUC19 BACKBONE d d > 

>AseI 
I 

>AseI >P ^ T | 

! 1990 2000 2010 2020 2030 I 2040 

* * * * * * * * 

TTAATTGCGT TGCGCTCACT GCCCGCTTTC CAGTCGGGAA ACCTGTCGTG CCAGCTGCAT 
d d PUC19 BACKBONE d d > 

>EarI 
I 

>EaeI >HaeII >SapI 

1 ' 1 

2050 2060 2070 2080 2090 I 2100 

I* * * * * * * *,*[** 

TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC TTCCGCTTCC 
d d PUC19 BACKBONE d d > 

>BsiEI >BsrBI 

I I 

2110 2120 12130 2140 I 2150 2160 

* * * * * I * * * I * * * . * 
TCGCTCACTG ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC GAGCGGTATC AGCTCACTCA 

d d PUC19 BACKBONE d d > 

>Af 1III 

i 

2170 2180 2190 2200 2210 2220 

** *.* * * * * * * * * 

AAGGCGGTAA TACGGTTATC CACAGAATCA GGGGATAACG CAGGAAAGAA CATGTGAGCA 
d d PUC19 BACKBONE d d > 

- 2230 2240 2250 2260 2270 2280 

+ ** * + ** 

AAAGGCCAGC AA-AAGGCCAG GAACCGTAAA AAGGCCGCGT TGCTGGCGTT TTTCCATAGG 
d d PUC19 BACKBONE d d > 

>DrdI 
I 

2290 2300 2310 2320 2330 2340 

** ** ** *|* ** 

CTCCGCCCCC CTGACGAGCA TCACAAAAAT CGACGCTCAA GTCAGAGGTG GCGAAACCCG 
d d PUC19 BACKBONE _d d > 

>BssSI 
I 

2350 2360 2370 2380 | 2390 2400 

+ * ** * .* *+ (** ** 

ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG CTCTCCTGTT 
d d PUC19 BACKBONE d d > 
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>BsaWI >HaeII 

I I 

2410 12420 2430 2440 2450 2460 

* * * * ** ** ** .# | + 

CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC CTTCGGGAAG CGTGGCGCTT 
d d PUC19 BACKBONE d d_ > 



>SfcI 

2470 ! 2480 2490 2500 . 2510 2520 

* + * * * * + * * * * * 

TCTCAATGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG TCGTTCGCTC CAAGCTGGGC 
d d PUC19 BACKBONE d d > 

> Bs iHKAI 

>ApaLI| >BsiEI >BsaWI 

! 2530 2540 2550 2560 I 2570 2580. 

I* I* * * * * **!★* * * 

TGTGTGCACG AACCCCCCGT TCAGCCCGAC CGCTGCGCCT TATCCGGTAA CTATCGTCTT 
. d d PUC19 BACKBONE d d _> 

>AlwNI 
I 

2590 2600 2610 2620 12630 2640 

+ * * * + * * * *j* * * 

GAGTCCAACC CGGTAAGACA CGACTTATCG CCACTGGCAG CAGCCACTGG TAACAGGATT 
d d PUC19 BACKBONE d d > 

>SfcI 
I 

2650 2660 12670 2680 2690 2700 
** * * * | * * * * * * * 
AGCAGAGCGA GGTATGTAGG CGGTGCTACA GAGTTCTTGA AGTGGTGGCC TAACTACGGC 
d d PUC19 BACKBONE d d > 

>Eco57I 

"1 

, 2 710 2720 2730 2740 2750 2760 
* * . * * * * * * * I * 
TACACTAGAA GGACAGTATT TGGTATCTGC GCTCTGCTGA AGCCAGTTAC CTTCGGAAAA 
d d PUC19 BACKBONE d d > 



* 



2770 2780 2790 2800 2810 2820 

* ** * * ** ** ** 

AGAGTTGGTA GCTCTTGATC CGGCAAACAA ' ACCACCGCTG GTAGCGGTGG TTTTTTTGTT 
d d PUC19 BACKBONE d d > 



2830 2840 2850 2860 2870 2880 

★ ** * * ** + * ** 

TGCAAGCAGC AGATTACGCG CAGAAAAAAA GGATCTCAAG AAGATCCTTT GATCTTTTCT 
d d PUC19 BACKBONE d d > 

>BspHI 
I 

2890 2900 2910 2920 2930 2940 

+ + * * ** ** * + ** 

ACGGGGTCTG ACGCTCAGTG GAACGAAAAC TCACGTTAAG GGATTTTGGT CATGAGATTA 
d d PUC19 BACKBONE d d > 

FIG. 6 
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>DraI >DraI 
I I 
2950 2960 2970 2980 2990 3000 

* * * * * i* * * + | * * * 
TCAAAAAGGA TCTTCACCTA GATCCTTTTA AATTAAAAAT GAAGTTTTAA ATCAATCTAA 

d d PUC19 BACKBONE d d > 

>BanI 

I 

3010 3020 3030 3040 3050 | 3060 

* * * * * + * * **|** 

AGTATATATG AGTAAACTTG GTCTGACAGT TACCAATGCT TAATCAGTGA GGCACCTATC 

_a AMP-ORF a > 

d d PUC19 BACKBONE _d d > 

□ >AhdI 

Lf] ■ I 

M 3070 3080 3090 3100 1 3110 3120 

* * * * * * + * j * * * * 

TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC TCCCCGTCGT GTAGATAACT 
a a AMP-ORF ' a a > 



PUC19 BACKBONE d 



>BsaI 
I 

>BsrDI >BpmI 
I I 

3130 . 3140 3150 3160 I 3170 ] , 3180 

** * * * * 

ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA TGATACCGCG AGACCCACGC 

a a AMP-ORF a a > 

d d PUC19 BACKBONE d d > 



>BsrFI >Bq±I 

I 3190 3200 3210 3220 | 3230 3240 

I** ** ** 

TCACCGGCTC CAGATTTATC AGCAATAAAC CAGCCAGCCG GAAGGGCCGA GCGCAGAAGT 
a a AMP-ORF a a > 

d d PUC19 BACKBONE d d > 

>AseI 
I 

3250 3260 3270 I 3280 3290 . 3300 

** + * ** ** ** ** 

GGTCCTGCAA CTTTATCCGC CTCCATCCAG TCTATTAATT GTTGCCGGGA AGCTAGAGTA 

a a AMP-ORF a a > 

d d PUC19 BACKBONE d d __> 

>Pspl406I 
I 

>FspI I >BsrDI >SfcI >MslI 

I I I I I 

3310 3320 I 3330 3340 I 3350 I 3360 

** ** * |* *|*|** + * 

AGTAGTTCGC CAGTTAATAG TTTGCGCAAC GTTGTTGCCA TTGCTACAGG CATCGTGGTG 

a a AMP-ORF a a > 

d d PUC19 BACKBONE d d > 
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>BsaWI 
I 

3370 3380 3390 I 3400 3410 3420 

* * * * **)★* + * * * 

TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCCGGTT CCCAACGATC AAGGCGAGTT 

a a AMP-ORF a a > 

" d d PUC19 BACKBONE d d > 



>BsiEI 
1 

>PvuI 

I 

3430 3440 3450 3460 3470 I 3480 

* * * * + * * * 

ACATGATCCC CCATGTTGTG CAAAAAAGCG GTTAGCTCCT TCGGTCCTCC GATCGTTGTC 

a a AMP-ORF a a > 

d d PUC19 BACKBONE d d > 

>EaeI ■ >MslI 

| I 

3490 I 3500 3510 I 3520 3530 3540 

* * I * * **|** * * * * 

AGAAGTAAGT TGGCCGCAGT GTTATCACTC ATGGTTATGG CAGCACTGCA TAATTCTCTT 

a a AMP-ORF a a > 

~ d d PUC19 BACKBONE d d > 

>ScaI 
I 

3500 3560 3570 . 3580 I 3590 3600 

* * * * * * * * j * * + * 

ACTGTCATGC CATCCGTAAG ATGCTTTTCT GTGACTGGTG AGTACTCAAC CAAGTCATTC 

a a AMP-ORF a a > 

d d PUC19 BACKBONE d d > 

>BsiEI 

3610 3620 i 3630 3640 3650 3660 

* * * * * * * * 

TGAGAATAGT GTATGCGGCG ACCGAGTTGC TCTTGCCCGG CGTCAATACG GGATAATACC 
a a AMP-ORF _a a > 



PUC19 BACKBONE d 



>Pspl406I 
I 

>DraI >Bs iHKAI >XmnI 

I t t 

3670 3680 3690 3700 I 3710 3720 

* + * * *l* **|** ** 
GCGCCACATA GCAGAACTTT AAAAGTGCTC ATCATTGGAA AACGTTCTTC GGGGCGAAAA 

a a AMP-ORF a a > 

d d PUC1 9 BACKBONE d d > 

>ApaLI 
1 

>Eco57I 
I 

>BssSI I >BsiHKAI 
i I I 

3730 3740 3750 3760 3770 I 3780 

* * * * * * * * I * I * 
CTCTCAAGGA TCTTACCGCT GTTGAGATCC AGTTCGATGT AACCCACTCG TGCACCCAAC 

a a AMP-ORF a a 

d d PUC19 BACKBONE d d 
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3790 3800" 3810 3820 3830 3840 

* * * + * * * * , * * * * 

TGATCTTCAG CATCTTTTAC TTTCACCAGC GTTTCTGGGT GAGCAAAAAC AGGAAGGCAA 

a a AMP-ORF a a - 

d d~ PUC19 BACKBONE d_ d = 

>MslI >EarI 
i i 
3850 3860 3870 I 3880 3890 3900 

* * I * * * * * I * 

AATGCCGCAA AAAAGGGAAT AAGGGCGACA CGGAAATGTT GAATACTCAT ACTCTTCCTT 

a a_ AMP-ORF a > 

d d PUC 1 9 BACKBONE d d : 

>SspI >BspHI >BsrBI 

I II 

3910 3920 3930 3940 I 3950 3960 

* , * * * * * * I * |* * * * 
TTTCAATATT ATTGAAGCAT TTATCAGGGT TATTGTCTCA TGAGCGGATA CATATTTGAA 

d d PUC 19 BACKBONE d d : 

3970 3980 3990 4000 4010 4020 

* * * * * * * * * * + * 

TGTATTTAGA AAAATAAACA AATAGGGGTT CCGCGCACAT TTCCCCGAAA AGTGCCACCT 
d d PUC 19 BACKBONE d d : 

>HincII 
I 

>AatII 
I I 

>AccI 
I I 

>SalI 
III 
1* I 
GACGTC 

> 
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>KincII 
I 

>AccI 
I f " 

>BglII >SalI 
| II! 
10 | 20 30111 40 50 60 

* * ( * * * * I I I* * * * * 
GACGGATCGG GAGATCTCCC GATCCCCTAT GGTCGACTCT CAGTACAATC TGCTCTGATG 

>AlwNI 

70 80 90 100 110 120 

* * * * * * * * + * * * 

CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT GGAGGTCGCT GAGTAGTGCG 

>A P oI ' >MfeI 

I I 

1130 140 150 160 I 170 . 180 

* -I * * * ' * * * * I * * * 
CGAGCAAAAT TTAAGCTACA ACAAGGCAAG GCTTGACCGA CAATTGCATG AAGAATCTGC 

>HincII 

>Af llll 
I 

>NruI >MluI 
I I 
190 200 1210 220 1230 

* * * * *j* * * * I * . . 
TTAGGGTTAG GCGTTTTGCG CTGCTTCG CGA TGT ACG GGC CAG ATA TAC GCG TTG 

Arg Cys Thr Gly Gin lie Tyr Ala Leu> 
f_ f CMV PROMOTER f f > 

>SpeI >AseI 

240 250 I 260 270 280 

* *j* * | * * * * * 
ACA TTG ATT ATT GAC TAG TTA TTA ATA GTA ATC AAT TAC GGG GTC ATT 
Thr Leu He He Asp *** Leu Leu He Val He Asn Tyr Gly Val Ile> 

f , f f f f f_CMV PROMOTER f f f f f f > 

290 300 310 320 330 
* . * * * * * 
AGT TCA TAG CCC ATA TAT GGA GTT CCG CGT TAC ATA ACT TAC GGT AAA 
Ser Ser *** Pro He Tyr Gly Val Pro Arg Tyr He Thr Tyr Gly Lys> 
f f f f f f_CMV PROMOTER f f f f f f > 

>BglI >AatII 

| I 
340 350 360 370 1 
********* 
TGG CCC GCC TGG CTG ACC GCC CAA CGA CCC CCG CCC ATT GAC GTC AAT 
Trp Pro Ala Trp Leu Thr Ala Gin Arg Pro Pro Pro He Asp Val Asn> 
f f f f f f_CMV PROMOTER f f f f f f > 

380 390 400 410 420 

** * ** * * _ * 

AAT GAC GTA TGT TCC CAT AGT AAC GCC AAT AGG GAC TTT CCA TTG ACG 
Asn Asp Val Cys Ser His Ser Asn Ala Asn Arg Asp Phe Pro Leu Thr> 
f f f f f f CMV PROMOTER f f f f f f > 
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>AatII >BglI 

| I 

430 440 450 460 | 470 

* * * * . * *(* * * 

TCA ATG GGT GGA CTA TTT ACG GTA AAC TGC CCA CTT GGC AGT ACA TCA 
Ser Met Gly Gly Leu Phe Thr Val Asn Cys Pro Leu Gly Ser Thr Ser> 
f f f f f f _CMV PROMOTER f f f f f f ^ > 

>NdeI >AatII 

I I 
480 I 490 500 510 I 520 
* |* * * +_. _■* * | . * * 
AGT GTA TCA TAT GCC AAG TAC GCC CCC TAT TGA CGT CAA TGA CGG TAA 
Ser Val Ser Tyr Ala Lys Tyr Ala Pro Tyr *** Arg Gin *** Arg.***> 
f f f f f f_CMV PROMOTER f f f f f f > 

>BglI 

530 ! 540 550 560 570 

* * I * * * * *■ * + * 

ATG GCC CGC CTG GCA TTA TGC CCA GTA CAT GAC CTT ATG GGA CTT TCC 
Met Ala Arg Leu Ala Leu Cys Pro Val His Asp Leu Met Gly Leu Ser> 
f f f f f f_CMV PROMOTER f f f f_ f f > 

>BsaAI >StyI 

I ! 
>SnaBI >NcoI >MslI 

I I I 
580 590 600 610 I 
* * * + + * * * * 
TAC TTG GCA GTA CAT CTA CGT ATT AGT CAT CGC TAT. TAC CAT GGT GAT 
Tyr Leu Ala Val His Leu Arg He Ser His Arg Tyr Tyr His Gly Asp> 
f f f f f f CMV PROMOTER f f f f f f > 



620 630 640 650 660 

+ * * * * * * * * 

GCG GTT TTG GCA GTA CAT CAA TGG GCG TGG ATA GCG GTT TGA CTC 
Ala Val Leu Ala Val His Gin Trp Ala Trp He Ala Val *** Leu 
f f f f f f CMV PROMOTER f f f f f f 



>AatII >BanI 
I * I 

670 680 690 I 700 710 I 

* * * * * * I * * * * 

GGG ATT TCC AAG TCT CCA CCC CAT TGA CGT CAA TGG GAG TTT GTT TTG 
Gly He Ser Lys Ser Pro Pro His *** Arg "Gin Trp Glu Phe Val . Leu> 
f f f f f f _CMV PROMOTER f f f f f f > 

720 730 740 750 760 

* ** * ** 

GCA CCA AAA TCA ACG GGA CTT TCC AAA ATG TCG TAA CAA CTC CGC CCC 
Ala Pro Lys Ser Thr Gly Leu Ser Lys Met Ser *** Gin Leu Arg Pro> 
f f f f f f_CMV PROMOTER f f f f f f > 

770 780 790 800 810 

* ** **** *** 

ATT GAC GCA AAT GGG CGG TAG GCG TGT ACG GTG GGA GGT CTA TAT AAG 
He Asp Ala Asn Gly Arg *** Ala Cys Thr Val Gly Gly Leu Tyr Lys> 
f f f f f f CMV PROMOTER f f f_ f f f > 
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>BsiHKAI 



SacI 
I 

Banll 

I 

>Ecll36lI 
I I 
I 820 
I 



830 



840 



850 



CAG AGC TCT CTG GCT AAC TAG AGA ACC CAC TGC TTA CTG GCT TAT CGA 
Gin Ser Ser Leu Ala Asn *** Arg Thr His Cys Leu Leu Ala Tyr Arg> 
f f f f f f CMV PROMOTER f f f f f f > 



>KpnI 



>AseI 
I 

>T7_PROMOTER 

I I 
860 | I 870 

r 



>BsaI ■ 
I 

>SfcI 



880 



>HindIII 
I 



890 

1 1 



>BanI 
I 

> Acc65I 



900 



910 



AAT T AATACGA CTCACTATAG GGAGACCCAA GCTTCGCGCG GGTACCACTC 
Asn Xxx> 



>EarI 
I 

I 920 



>Pf 1MI 



930 



>PvuII 
I 

940 



>BanII 
I 

950 



960 



970 



TCTTCCGCAT CGCTGTCTGC GAGGGCCAGC TGTTGGGCTC GCGGTTGAGG ACAAACTCTT 
g TRIPARTITE LEADER SEQUENCE g > 



>EarI 



>ScaI 



980 



990 



1000 



1010 



1020 



1030 



CGCGGTCTTT CCAGTACTCT TGGATCGGAA ACCCGTCGGC CTCCGAACGG TACTCCGCCA 
g TRIPARTITE LEADER SEQUENCE g > 

>XhoI 



>EcoO109I 
I 

>PpuMI 



>BsiEI 
I 



>BsaWI 

I I 
1060 

* I * * * * I I * 

CCGAGGGACC TGAGCGAGTC CGCATCGACC GGATCGGAAA ACCTCTCGAG GAACTGAAAA , 



1040 



1050 



1070 



>AvaI 
I 

>BsoBI 
I 

>PaeR7l 
I 

I 1080 



1090 
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TRIPARTITE LEADER SEQUENCE, 



>HpaI >PpuMI 

I I 

>HincII >Eco0109l 

I I 
1100 I 1110 1120 1130 1140 1150 

* *|* * + * * * * | * ■ * . * 

ACCAGAAAGT TAACTGGTAA GTTTAGTCTT TTTGTCTTTT TATTTCAGGT CCCGGATCTG 
b HYBRID SV4 0 LATE 1NTRON b b > 

>PpulOl" ■ 
I 

>21 bp tandem repeat III [ 110] ,[ 102] ,[ 112] I 
" " * ' T " I 

1160 1170 1180 11190 1200 1210 

* * * * * * *|* **_*(* 

AGTTAGGGCG GGACATGGGC GGAGTTAGGG GCGGGACTAT GGTTGCTGAC TAATTGAGAT 
< h h_EARLY MRNA h 

>SphI 
I 

>NsiI 

j <72 bp_tandem_repeat_enhancer_sequence_ 

| 1220 1230 1240 1250 1260 | 1270 

* + * * * * * * * * I * * 

GCATGCTTTG CATACTTCTG CCTGCTGGGG AGCCTGGGGA 'CTTTCCACAC CTGGTTGCTG 
h h EARLY MRNA h h 

>NsiI 
I ■ 

>Ppul0I |>SphI 

1280 | ! 1290 1300 1310 1320 1330 

* * * * *.* * * * 
ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG GACTTTCCAC 

< h h EARLY MRNA h h 

>PvuII >BsaWI >BseRI 

I I I 

<72 bp tandem__repeat_enhancer_sequence_B_ 

~ ~~ <T_ant igen_binding_s ite_I I_ I 

| III i 

| 1340 1350 I 1360 11370 1380 1390 

■j * * * * * + * I * * * * I * 

ACCCTAACTG ACACACATTC CACAGCTGGT TCTTTCAGAT CCGGTGGTGG TGCAAATCAA 

HYBRID SV40 > 

< h EARLY MRNA 



< 



MINOR LATE 19S 



>StuI 



1400 1410 1420 1430 1440 1450 

* * * * *,+ *|* * * * + 

AGAACTGCTC CTCAGTGGAT GTTGCCTTTA CTTCTAGGCC TGTACGGAAG TGTTACTTCT 

c HYBRID SV4 0 LATE INTRON c > 
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>BsiEI 



>NotI 
I 

>EaeI 
I 

>SacII 
I 

>EagI 
I 

1480 



I I I I I 

1460 1470 1480 I I 1490 1500 I 1510 

I* I * * I * .1* 

GCTCTAAAAG CTGCGGAATT GTACCCGCGG CCGCTGCAGT . CTAGACGAAT TCGCGTACGA 
HYBRID SV4 0 LATE INT > 



>XbaI 

1 

>PstI 
I I 

>SfcI I i 



>EcoRI 
I 

>AdoI >BsiWI 



>ApaI 
I 

>BspDI >BanII 

I I 
>ClaI >Eco0109I 



>EcoRV 



I I 
>Bspl20I 



>SfcI 
I 
I 



>MslI 



>SacI 



>BsiHKAI 
I 

>BanII 
I 

>Ecll36II| >BclI 
I I 



>BGH POLY A 



I 



III _ 
Hi i I I I I I 

1520 I 1530 15401 1550 I I 1560 

. , * I I* I * . 

TATCGATGGG CCCTATT CTA TAG TGT CAC CTA AAT GCTAG AGCTCGCTGA 

Leu *** Cys His Leu Asn> 
d_SP6 PROMOTER d > 

1570 1580 1590 1600 1610 1620 

* * * * * . * * ★ * * * * 

TCAGCCTCGA CTGTGCCTTC TAGTTGCCAG CCATCTGTTG TTTGCCCCTC CCCCGTGCCT 
>BanI 

1630 |l640 1650 1660 1670 1680 

, * * +j* * ■ * * * * * * * 

TCCTTGACCC TGGAAGGTGC CACTCCCACT GTCCTTTCCT AATAAAATGA GGAAATTGCA 

1690 1700 1710 1720 1730 1740 

+ * * * * * * * * * * * 

TCGCATTGTC TGAGTAGGTG TCATTCTATT CTGGGGGGTG GGGTGGGGCA GGACAGCAAG 

>BspMI 
I 

>BbsI >BssSI 
1 i 
1750 1760 1770 1780 1790 1800 

** * * + * + * * * **■ 

GGGGAGGATT GGGAAGACAA TAG C C G AAAT GACCGACCAA GCGACGCCCA ACCTGCCATC 



1810 



1820 



1830 



1840 



1850 



1860 
★ 



ACGAGATTTC GATTCCACCG CCGCCTTCTA TGAAAGGTTG GGCTTCGGAA TCGTTTTCCG 
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>NaeI 
I 

>BpmI 

I I 
>BsrFI 

I I 
NgoMI 

! 1870 1880 1890 1900 1910 1920 

* j + * * * * + * * * * * 

GGACGCCGGC TGGATGATCC TCCAGCGCGG GGATCTCATG CTGGAGTTCT TCGCCCACCC 

>BpmI >A P Ql 
I 

>SV4 0_early_poly_A 

j 1930 1940 1950 I960 1970 I 1980 

I * * + * * * * * * * 

CAACTTGTTT ATTGCAGCTT ATAATGGTTA CAAATAAAGC AATAGCATCA CAAATTTCAC 

>BsmI 

1990 2000 I 2010 2020 2030 2040 

* * * *|* * * * * . * * • * 
AAATAAAGCA TTTTTTTCAC TGCATTCTAG TTGTGGTTTG TCCAAACTCA TCAATGTATC 

>HincII 
I 

>Bstll07I >AccI 
I I I 

>AccI >SalI 

II 111 „ . 

2050 1 I 2060 I . 2070 2080 2090 2100 

* * | | * | * | * * * * * * * * 
TTATCATGTC TGTATACCGT CGACCTCTAG CTAGAGCTTG GCGTAATCAT GGTCATAGCT 

PUC19 BACKBONE > 

>BsrBI 
I 

2110 2120 I 2130 2140 2150 2160 

+ * ~ * * *,* * * *. * * * 

GTTTCCTGTG TGAAATTGTT ATCCGCTCAC AATTCCACAC AACATACGAG CCGGAAGCAT 
e e PUC19 BACKBONE e e > 

>BanI >AseI 
1 I 
2170 2180 2190 2200 j 2210 2220 
+ * . * j * * * * * |* * * * 
AAAGTGTAAA GCCTGGGGTG CCTAATGAGT GAGCTAACTC ACATTAATTG CGTTGCGCTC 
e e PUC19 BACKBONE e e > 

>PvuII >AseI >EaeI 
I I I 

2230 2240 2250 2260 ! 2270 | 2280 

* * * * * * * I * I * *!* * 
ACTGCCCGCT TTCCAGTCGG GAAACCTGTC GTGCCAGCTG CATTAATGAA TCGGCCAACG 

e e PUC19 BACKBONE e e > 
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>SapI 
I 

>HaeII >EarI 
I t 

2290 2300 2310 | 2320 2330 2340 

* * * * * * j * | * * * * * 

CGCGGGGAGA GGCGGTTTGC GTATTGGGCG CTCTTCCGCT TCCTCGCTCA CTGACTCGCT 
e e PUC19 BACKBONE e e > 

>BsiEI >BsrBI 

2350 2360 12370 2380 2390 2400 
I* * * * | * * * * * 
GCGCTCGGTC GTTCGGCTGC GGCGAGCGGT ATCAGCTCAC TCAAAGGCGG TAATACGGTT 
e e PUC19 BACKBONE e e > 

>AflIII. 

2410 2420 • 2430 I 2440 2450 2460 

** * . * **(** * * * * 

ATCCACAGAA TCAGGGGATA ACGCAGGAAA GAACATGTGA GCAAAAGGCC AGCAAAAGGC 
e e PUC19 BACKBONE e e > 

2470 2480 2490 2500 2510 2520 ' 

* * . * * * + * * . * * * * 

CAGGAACCGT AAAAAGGCCG CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC CCCCTGACGA 
e e PUC19 BACKBONE e e > 

>DrdI 

2530 2540 ! 2550 2560 2570 2580 

* * + * * * * * * 

GCATCACAAA AATCGACGCT CAAGTCAGAG GTGGCGAAAC CCGACAGGAC TATAAAGATA 
e e PUC19 BACKBONE e e > 

>BssSI >BsaWI 
I I 
2590 2600 12610 2620 2630 2640 

* * * * | * * * * * * 1* 
CCAGGCGTTT CCCCCTGGAA GCTCCCTCGT GCGCTCTCCT GTTCCGACCC TGCCGCTTAC 

e e PUC19 BACKBONE e e > 

>HaeII >SfcI 

I I 

2650 2660 2670 2680 I 2690 2700 

* * * • * * * * I * * * I * 
CGGATACCTG TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG CTTTCTCAAT GCTCACGCTG 

e e PUC19 BACKBONE e e > 

>Bs iHKAI 
>ApaLI 

I 

2710 2720 2730 2740 2750 



★ 



2760 
* 



TAGGTATCTC AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC ACGAACCCCC 
e e PUC19 BACKBONE e e > 

FIG. 8 

(CONTINUED) 
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>BsiEI >3saWI 

I t 

2770 ! 2780 | 2790 2800 2810 2820 

+ *!** +|* * * * * + * 

CGTTCAGCCC GACCGCTGCG CCTTATCCGG TAACTATCGT CTTGAGTCCA ACCCGGTAAG 
^ e e PUC19 BACKBONE e e > 

>AlwNI 
I 

2830 2840 2850 2860 - 2870 2880 

** + + +1* * * * * * * 

ACACGACTTA TCGCCACTGG CAGCAGCCAC TGGTAACAGG ATTAGCAGAG CGAGGTATGT 
e e PUC19 BACKBONE e e > 

>SfcI 

2890 2900 2910 2920 2930 2940 
*j* * * * * * * * * * * 
AGGCGGTGCT ACAGAGTTCT TGAAGTGGTG GCCTAACTAC - GGCTACACTA GAAGGACAGT 
e e PUC19 BACKBONE e e > 

>Eco57I 
I 

2950 2960 2970 2980 I 2990 3000 
** * * * * * * 
" ATTTGGTATC TGCGCTCTGC TGAAGCCAGT TACCTTCGGA AAAAGAGTTG GTAGCTCTTG 
e e PUC19 BACKBONE e e > 

3010 3020 3030 3040 3050 3060 

** * * * * * * * * * + 

ATCCGGCAAA CAAACCACCG CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC AGCAGATTAC 
e e PUC19 BACKBONE e e > 

3070 3080 3090 3100 3110 3120 

GCGCAGAAAA AAAGGATCTC AAGAAGATCC TTTGATCTTT TCTACGGGGT CTGACGCTCA 
e e PUC19 BACKBONE e e > 

>BspHI 
f 

3130 3140 3150 i 3160 3170 3180 

| * * * * 
GTGGAACGAA AACTCACGTT AAGGGATTTT GGTCATGAGA TTATCAAAAA GGATCTTCAC 
e e PUC19 BACKBONE e e > 

>DraI >DraI 
| I 
3190 | 3200 3210 I 3220 3230 3240 

CTAGATCCTT TTAAATTAAA AATGAAGTTT TAAATCAATC TAAAGTATAT ATGAGTAAAC 
e e PUC19 BACKBONE e e > 

>BanI 
I 

3250 3260 3270 I 3280 3290 3300 

** ** *■*(** ** ** 

TTGGTCTGAC AGTTACCAAT GCTTAATCAG TGAGGCACCT ATCTCAGCGA TCTGTCTATT 

a a AMP-ORF_a a 

e e PUC19 BACKBONE e e 



FIG. 8 

(CONTINUED) 
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>AhdI 
I 

3310 3320 13330 3340 3350 3360 

^ * * * *j* * * * + ★ + 

TCGTTCATCC ATAGTTGCCT GACTCCCCGT CGTGTAGATA ACTACGATAC GGGAGGGCTT 

a a AMP-ORF a a > 

e e PUC19 BACKBONE e e > 



>BsaI 
I 

>BsrDI >BpmI >BsrFI 

t I 1 

3370 3380 3390 |3400 13410 3420 

★ * + * *|* *j* *|* * * 

ACCATCTGGC CCCAGTGCTG CAATGATACC GCGAGACCCA CGCTCACCGG CTCCAGATTT 

a a AMP-ORF a a > 

e e PUC19 BACKBONE e e > 



>BglI 
I 

3430 3440 ■ |3~450 3460 3470 3480 

★ * * * *|* * + * * * ★ 

ATCAGCAATA AACCAGCCAG CCGGAAGGGC CGAGCGCAGA AGTGGTCCTG CAACTTTATC 

a a AMP-ORF a a > 

e e PUC19 BACKBONE e e > 



>AseI 
I 

3490 3500 3510 3520 " 3530 3540 

** *|* * * * * * * + * 

CGCCTCCATC CAGTCTATTA ATTGTTGCCG GGAAGCTAGA GTAAGTAGTT CGCCAGTTAA 

a a AMP-ORF a a > 

e e PUC19 BACKBONE e e > 



>Pspl406l 

>FspI I >BsrDI >SfcI >MslI 

It I I i 

3550 | 3560 I 3570 3580 3590 3600 

*|*|* *|* * * * + 

TAGTTTGCGC AACGTTGTTG CCATTGCTAC AGGCATCGTG GTGTCACGCT CGTCGTTTGG 

a a AMP-ORF a a > 

= e e PUC19 BACKBONE e e > 



>BsaWI 

3610 3620 3630 3640 3650 3660 

* + * | * ** * * ** + * 

TATGGCTTCA TTCAGCTCCG GTTCCCAACG . ATCAAGGCGA GTTACATGAT CCCCCATGTT 

a a AMP-ORF a a > 

e e PUC19 BACKBONE e e > 



>BsiEI 
i 

>PvuI >EaeI 
I I 
3670 3680 3690 I 3700 3710 | 3720 

* * * * ** * | * ** |** 

GTGCAAAAAA GCGGTTAGCT CCTTCGGTCC TCCGATCGTT GTCAGAAGTA AGTTGGCCGC 

a a AMP-ORF a a > 

~~ e e PUC19 BACKBONE e e > 



FIG. 8 
(CONTINUED) 
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>MslI 
I 

3730 3740 3750 3760 3770 3780 

+ * * I * * * * * + * * * 

AGTGTTATCA CTCATGGTTA TGGCAGCACT GCATAATTCT CTTACTGTCA TGCCATCCGT 

a a AMP-ORF a a > 

e . e PUC19 BACKBONE e e > 

>ScaI 
I 

3790 3800 I3810 3820 3830 3840 

* * * * *l* * * + + *.* 

AAGATGCTTT TCTGTGACTG GTGAGTACTC AACCAAGTCA TTCTGAGAAT AGTGTATGCG 

a a AMP-ORF a a > 

" e e PUC19 BACKBONE e e > 



>BsiEI 

| 3850 3860 3870 3880 3890 3900 

* * **'**■ * + * * * * 

GCGACCGAGT TGCTCTTGCC CGGCGTCAAT ACGGGATAAT ACCGCGCCAC ATAGCAGAAC 

a a AMP-ORF a a > 

e e PUC19 BACKBONE e e > 

>Pspl406I 
I 

>DraI > Bs iHKAI >XmnI 
I I I 

i 3910 | 3920 1 3930 3940 3950 3960 

* * * * * * * * 

' TTTAAAAGTG CTCATCATTG GAAAACGTTC TTCGGGGCGA AAACTCTCAA GGATCTTACC 

a a AMP-ORF a a > 

e e PUC19 BACKBONE e e > 

>Eco57I 
I 

>ApaLI 



>BssSI 
I 

3970 3980 3990 

* I * *. * * * 
GCTGTTGAGA TCCAGTTCGA TGTAACCCAC TCGTGCACCC AACTGATCTT CAGCATCTTT 
a a AMP-ORF a a > 



>BsiHKAI 
I 

4000 4010 4020 



PUC19 BACKBONE e 



4030 4040 4050 4060 4070 4080 



* 



TACTTTCACC AGCGTTTCTG GGTGAGCAAA AACAGGAAGG CAAAATGCCG CAAAAAAGGG 
a a AMP-ORF a a > 



PUC19 BACKBONE e 



>MslI >EarI >SspI 

I I I 

4090 14100 4110 4120 I 4130 4140 

* + *j* * * * * | * * * . * 

AATAAGGGCG ACACGGAAAT GTTGAATACT CATACTCTTC CTTTTTCAAT ATTATTGAAG 

a AMP-ORF a > 

e e PUC19 BACKBONE e e > 



FIG. 8 

(CONTINUED) 
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>BspHI >BsrBI 
I I 

4150 4160 1 4170 4180 4190 4200 

* * * *|*|* * * + . + * * 

CATTTATCAG GGTTATTGTC TCATGAGCGG ATACATATTT GAATGTATTT AGAAAAATAA 
_e e PUC19 BACKBONE e e_ > 

>HincII 
I 

>AccI 
I I 

>AatII 
I I 

>SalI 
I I I 

4210 4220 4230 4240 I I I 

* * * * * * * * * I I I 
ACAAATAGGG GTTCCGCGCA CATTTCCCCG AAAAGTGCCA CCTGACGTC 

e PUC19 BACKBONE e > 



FIG. 8 

(CONTINUED) 
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